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WHAT IS CLAIMED IS : 

1. A method for automatically determining at least one modal value of 
non-numeric data comprises: 

selecting a data subset from a dataset, the data subset comprising at least a 
portion of the dataset and including at least one non-numeric value; and 

automatically determining at least one modal value based on the selected data 

subset. 

2. The method of Claim 1, wherein selecting the data subset from the 
dataset comprises querying a database. 

3. The method of Claim 1, each value of the data subset comprising one 
of the following data types: 

float; 
integer; 
currency; 
date; 

decimal; or 
string. 

4. The method of Claim 1, wherein determining at least one modal value 
based on the selected data subset comprises: 

sorting the selected data subset by value; 

processing the sorted data subset to identify one or more modal groups, each 
modal group comprising one or more instances of a substantially identical value; and 

determining at least one modal value based, at least in part, on the one or more 
modal groups. 
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5. The method of Claim 4 further comprising determining a modal count 
for each modal group, each modal count comprising the number of instances of the 
substantially identical value in the associated modal group. 

6. The method of Claim 5, wherein determining at least one modal value 
based, at least in part, on the one or more modal groups comprises: 

determining a highest one or more modal counts; 

selecting the substantially identical value from each modal group associated 
with the highest model count; and 

assigning each selected substantially identical value to one modal value. 

7. The method of Claim 5, in response at least in part to each modal count 
being equal to one, assigning a null value to one modal value. 

8. The method of Claim 4, one of the modal groups comprising at least 
one lowercase string value and at least one mixed-case string value. 

9. The method of Claim 1, wherein determining at least one modal value 
based on the selected data subset comprises: 

selecting one data object from the data subset; 

comparing a value of the data object to a plurality of stored values in a lookup 
table, each stored value being associated with one modal count; 

in response, at least in part, to the value of the data object being located in the 
plurality of stored values, adding one to the associated modal count; 

selecting the highest one or more modal counts from the lookup table; and 

assigning each stored value associated with one of the highest modal counts to 
one modal value. 
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10. Software for automatically determining at least one modal value of 
non-numeric data operable to: 

select a data subset from a dataset, the data subset comprising at least a portion 
of the data set and including at least one non-numeric value; and 

automatically determine at least one modal value based on the selected data 

subset. 



11. The software of Claim 10, wherein the software operable to select the 
data subset of the dataset comprises software operable to query a database. 

12. The software of Claim 10, each value of the data subset comprising 
one of the following data types; 

float; 
integer; 
currency; 
date; 

decimal; or 
string. 



13. The software of Claim 10, wherein the software operable to determine 
at least one modal value based on the selected data subset comprises software 
operable to: 

sort the selected data subset by value; 

process the sorted data subset to identify one or more modal groups, each 
modal group comprising one or more instances of a substantially identical value; and 

automatically determine at least one modal value based, at least in part, on the 
one or more modal groups. 
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14. The software of Claim 13 further operable to determine a modal count 
for each modal group, each modal count comprising the number of instances of the 
substantially identical value in the associated modal group. 

15. The software of Claim 14, wherein the software operable to detennine 
at least one modal value based, at least in part, on the one or more modal groups 
comprises software operable to: 

determine a highest one or more modal counts; 

select the substantially identical value from each modal group associated with 
tiie highest model count; and 

assign each selected substantially identical value to one modal value. 

16. The software of Claim 14, in response at least in part to each modal 
count being equal to one, further operable to assign a null value to one modal value. 



1 7. The software of Claim 1 3, one of the modal groups comprising at least 
one lowercase string value and at least one mixed-case string value. 
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18. The software of Claim 10, wherein the software operable to determine 
at least one modal value based on the selected data subset comprises software 
operable to: 

selecting one data object from the data subset; 

comparing a value of the data object to a pluraUty of stored values in a lookup 
table, each stored value being associated with one modal count; 

in response to the value of the data object being located in the pluraUty of 
stored values, adding one to the associated modal count; 

selecting the highest one or more modal counts from the lookup table; and " 

assigning each stored value associated with one of the highest modal counts to 
one modal value. 



ATTORNEY DOCKET NO. : PATENT APPLICATION 

063170.2575 (20000298) 

20 

19. System for automatically deteraiining at least one modal value of non- 
numeric data comprises: 

memory operable to store a data set, the data set comprising a plurality of data 
objects and each data object comprising a data type and a value; and 
5 one or more processors operable to: 

select a data subset from the dataset, the data subset comprising at least 
a portion of the plurality of data objects and including at least one non-nmneric data 
object; and 

automatically determine at least one modal value based on the selected 

10 data subset. 

20. The system of Claim 19, wherein the processors operable to select the 
data subset of the dataset comprise processors operable to query a database. 

15 21. The system of Claim 19, each data object comprising one of the 

following data types: 

float; 

integer; 

currency; 
20 date; 

decimal; or 

string. 
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22. The system of Claim 19, wherein the processors operable to determine 
at least one modal value based on the selected data subset comprise processors 
operable to: 

sort the selected data subset by value; 

process the sorted data subset to identify one or more modal groups, each 
modal group comprising two or more instances of a substantially identical value; and 

automatically determine at least one modal value based, at least in part, on the 
one or more modal groups. 

23. The system of Claim 22, the processors further operable to determine a 
modal count for each modal group, each modal count comprising the number of 
instances of the substantially identical value in the associated modal group. 

24. The system of Claim 23, wherein the processors operable to determine 
at least one modal value based, at least in part, on the one or more modal groups 
comprise processors operable to: 

determine a highest one or more modal counts; 

select the substantially identical value from each modal group associated with 
the highest model coimt; and 

assign each selected substantially identical value to one modal value. 

25. The system of Claim 23, in response at least in part to each modal 
count being equal to one, the processors further operable to assign a null value to one 
modal value. 



26. The system of Claim 22, one of the modal groups comprising at least 
one lowercase string value and at least one mixed-case string value. 
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27. The system of Claim 19, wherein the processors operable to determine 
at least one modal value based on the selected data subset comprise processors 
operable to: 

selecting one data object from the data subset; 

comparing a value of the data object to a plurality of stored values in a lookup 
table, each stored value being associated with one modal count; 

in response to the value of the data object being located in the plurality of 
stored values, adding one to the associated modal coimt; 

selecting the highest one or more modal counts from the lookup table; and 
assigning each stored value associated with one of the highest modal counts to one 
modal value. 
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28. A system for automatically determining at least one modal value of 
non-numeric data comprises: 

means for selecting a data subset from a dataset, the data subset comprising at 
least a portion of the dataset and including at least one non-numeric value; and 

means for automatically determining at least one modal value based on the 
selected data subset. 



